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The financial market is a complex dynamical system composed of a large variety of intricate 
relationships between several entities, such as banks, corporations and institutions. At the heart of 
the system lies the stock exchange mechanism, which establishes a time-evolving network of trades 
among companies and individuals. Such network can be inferred through correlations between 
time series of companies stock prices, allowing the overall system to be characterized by techniques 
borrowed from network science. Here we study the presence of communities in the inferred stock 
market network, and show that the knowledge about the communities alone can provide a nearly 
complete representation of the system topology. This is done by defining a simple null model, a 
randomized version of the studied network sharing only the sizes and interconnectivity between 
communities observed. We show that many topological characteristics of the inferred networks are 
carried over the networks generated by the null model. In particular, we find that in periods of 
instability, such as during a financial crisis, the network strays away from a state of well-defined 
community structure to a much more uniform topological organization. We show that the framework 
presented here provides a good null model representation of topological variations taking place in 
the market during crises. Also, the general approach used in this work can be extended to other 
systems. 

I. INTRODUCTION 

The quantitative analysis of real complex systems is 
mostly focused on the study of time series, since the full 
knowledge of the underlying equations of a given dynamic 
system is hard or usually impossible to obtain from static 
data m- This situation is frequently found in the anal¬ 
ysis of a number of applicable domains, including finan¬ 
cial markets [5], physiological data [6], climate model¬ 
ing [3 and several other systems [7]. 

Many tools based on nonlinear dynamics (e.g., fractal 
dimensions, Lyapunov exponents and recurrence proper¬ 
ties) have been developed to analyze time series origi¬ 
nating from complex systems [7]. However, besides such 
conventionally studied approaches, recently, concepts of 
network science have been applied to this problem too. 

Generally, most available methods map time series into 
the network domain, so that a set of network measure¬ 
ments can be used to calculate the statistical proper¬ 
ties of the system. These methods can be quite different 
and include cycle networks [8], transition networks [9], 
fc-nearest-neighbors uni, visibility-graphs m and also 
recurrence networks [HUS] which is a natural extension 
of the traditional recurrence analysis of dynamical sys¬ 
tems [HI 15]. Ref. [13] provides a brief review of the anal¬ 
ysis of time series using complex networks techniques, 
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also comparing the approaches and pointing to possible 
pitfalls and limitations of their applications. 

The aforementioned methods analyze a single time se¬ 
ries using complex networks. However, one could also 
tackle complex systems consisting of sets of time series, 
rather than just a single sample. Examples of such sys¬ 
tems include the stock market in which each asset has 
a time evolving market price (SHU]; temporal spatio- 
grid climate data, where each grid point has a time se¬ 
ries associated to a given climate variable (e.g., temper¬ 
ature, wind velocity, etc. ) [22H2S]; and also grid tempo¬ 
ral data collected from electroencephalograms (EEG) or 
functional magnetic resonance images (fMRI) [26, 27]. 
Each of these systems are represented by discrete ele¬ 
ments whose physical interactions can be inferred from 
the corresponding time series. 

Given their discrete nature, the motivation for using 
the network approach to analyze such systems is nat¬ 
ural. Network techniques usually establish the connec¬ 
tions (interactions) by quantifying the statistical similar¬ 
ities between each discrete element, yielding a so-called 
functional network |26| . Moreover, the complex network 
approach to complex systems also provides an effective 
and comprehensive set of visualization tools, which al¬ 
lows important insights on the relation between the net¬ 
work structure and the underlying system behavior. The 
benefits of network visualization can be realized, for in¬ 
stance, in the study of biological [291152] , social [551 34] 
and transportation networks [35], among others [54H56] , 

Naturally, these functional networks can evolve with 
time, and techniques can be developed for network con¬ 
struction in order to analyze the underlying complex sys- 
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tem. In such an approach, the main objective is the de¬ 
tection of extreme events that significantly modify the 
network structure, which can in turn be related to crit¬ 
ical events in the underlying complex system. For in¬ 
stance, in the temporal analysis of climate networks one 
important issue is detecting extreme events caused by El 
Nino-Southern Oscillation (ENSO) [24l [25l I37H42] or the 
occurrences of Monsoons gam], in financial market 
networks, the effect of financial instabilities in the clus¬ 
ter organization of stocks is of interest mmmmm- 
In each case, the occurrence of extreme events is inferred 
from the detection of anomalies in the time series origi¬ 
nating from the network evolution. 

In this paper, we compare the set of time series ob¬ 
tained from the evolution of the inferred network with 
those obtained from a null model of the same network, 
in which only information about its communities is used. 
By null model we understand a family of randomized 
networks reproducing a subset of features of the original 
networks m- The methodology outlined in our paper 
is described in detail in Fig. [l] Using a financial market 
data set, we obtain the a time-evolving network (step a), 
which is inferred from the Pearson correlation coefficient 
between company daily closing stock values. A set of 
time series of topological measurements is obtained from 
the evolving network (step b), and used to characterize 
the evolution of network structure (step c). Using the 
widely documented crisis periods (step d), we evaluate 
the extent of which topological features correctly reflects 
the changes in network structure during different crises 
(step e). 

In order to create a community null model of the evo¬ 
lution of the stock market network, a set of community 
properties is obtained (step f). At each time step, a 
random network with same average degree and commu¬ 
nity structure as the corresponding inferred network is 
created using a stochastic blockmodel [52], generating a 
time-evolving community network (step g). Finally, once 
the time series of topological measurements are extracted 
(steps h and i), the network evolution derived from the 
random network with communities can be compared with 
the time series of topological measurements of the in¬ 
ferred network (step j). Such comparison needs a proper 
scale to allow us to conclude how similar the network 
generated from the community model is to the inferred 
network. This is done by using a well-known random 
network model, known as configuration model [34] . to 
also represent the inferred network. That is, the degree 
distribution of the network at each day is obtained (step 
k), and used to generate daily networks having the same 
degree distribution as the inferred ones (step 1). Topo¬ 
logical features of these networks are obtained (steps m 
and n) and compared with the same features obtained 
for the inferred networks (step o). If the community null 
model can provide a more suitable representation of the 
inferred networks, we expect the comparison done at step 
j to display a closer agreement between the time series 
than the comparison done at step o. 



FIG. 1: (Color online) Schematic representation of the differ¬ 
ent steps of our methodology. Please refer to the text for an 
explanation about each step. 


The suggested analyses for the stock market networks 
are built upon the stochastic blockmodel and are based 
solely on the community structure of the time evolving 
inferred network. As far as we know, our approach is dif¬ 
ferent from other approaches based on network commu¬ 
nity, as most of them are focused on testing community 
detection algorithms 53 55 . While more sophisticated 
community models, such as the degree-corrected block- 
model [52] . could also have been used, they would incor¬ 
porate not only the community structure but also other 
characteristics of the original system, such as degree dis¬ 
tribution and average path length. Therefore it could be 
difficult to disentangle the effects of these properties on 
the results. It is important to highlight that the pur¬ 
pose of the current work is not to predict the behavior 
of the financial market network, but to understand how 
informative is the community structure when traditional 
topological features are used to analyze such networks. 
This is performed by comparing a set of properties be¬ 
tween the inferred networks and the respective networks 
generated by the null model. Such null model can then 
be used to isolate the influence of community structure 
changes on other topological properties, such as assor- 
tativity and transitivity. Therefore, it provides a precise 
means to verify which properties are a trivial consequence 
of variations in the community structure. 
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We validate our framework by analyzing functional 
networks constructed through statistical similarities be¬ 
tween stocks traded at the New York Stock Exchange 
(NYSE). We show that the financial crashes are char¬ 
acterized by the presence of well-defined changes to the 
community structure, whereas outside these critical pe¬ 
riods the network topology is composed by communi¬ 
ties that remain stable for long periods. Accordingly, 
we found that the observed values of many topological 
properties of the financial market network are a direct 
consequence of the community structure of the system. 

This paper is organized as follows. In Sec. II we spec¬ 
ify how the time evolving network of the financial market 
is inferred, and describe some basic community changes 
observed in the network during a single crisis period. In 
Sec. Ill we present the methodology used to recover the 
market topology using daily mixing matrices. We high¬ 
light the relevance of the community structure for the 
financial market characterization. Finally, in Sec. IV we 
present the conclusions of the study. 

II. THE TIME-EVOLVING STOCK MARKET 
NETWORK 

The stock market database consists of daily prices of 
3799 stocks traded on the New York Stock Exchange. 
The stocks prices were obtained from the Yahoo! finan¬ 
cial database (http://finance.yahoo.com). We selected 
N = 348 stocks from this set, which are the stocks for 
which there are historical data from January 1986 to 
February 2011. For these stocks, we obtained C v = 6008 
closures prices per stock over the trading period. 

In the financial market networks analyzed in this pa¬ 
per, the nodes correspond to stocks and the links quantify 
the statistical similarity between the time series associ¬ 
ated to the stock closure prices evolution. In particular, 
in order to quantify the similarity between two time se¬ 
ries, we adopt the Pearson correlation coefficient 

_ {YiYj} — (Yi) (Yj) 
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where V(£) is the logarithm of return, i.e., V(t) = 
In Pi(t) — In Pi(t — 1) is the price return, where Pi(t) is 
the closing price of the i-th stock at day t. The ad¬ 
vantage of using Y(t) is that nonlinear and stochastic 
transformations are not needed in order to correct for 
some common trends in the data [56]. In addition, Y(t) 
is independent of inflation or discount factors. By cal¬ 
culating Eq. [l] between all pairs of stocks we obtain a 
fully connected weighted graph in which the link weights 
are given by pij. However, in order to analyze the net¬ 
work structure according to the stronger connections, 
we discard links whose weights are below a threshold e. 
This leads to a network defined by the adjacency ma¬ 
trix Aij = Q(pij — e) — Sij , where ©(•) is the Heaviside 
function and 5^ the Kronecker delta. 


The analysis of the time evolution of the inferred fi¬ 
nancial market network is illustrated in Fig. [2] First we 
set a time window of length At = 30 days inside which 
the correlations between stocks are calculated creating 
the network. Next, by sequentially sliding the window 
by St = 1 we are able to generate a sequence of networks 
corresponding to the evolution of the market. More pre¬ 
cisely, the first network A/^ 1 ) is constructed by calculating 
the correlations of the time series between t^ = 1 and 
t^ = 31, the second network AA 2 ) is constructed con¬ 
sidering data between instants = 2 and t^ = 32, 
and the n-th network encompasses the prices between 
t ^ = l-\-(n — l)St and t^ = t^ + At. Therefore, n rep¬ 
resents the index of the analyzed business day and takes 
value in the range [1 ,N W ], where N w = C p — At. The 
threshold e t used to define which edges are kept changes 
for each day. The value of et is set so that / = 10% of the 
possible N(N — l)/2 edges are kept at each day. That 
is, the average degree of the network is fixed. In Fig. SI 
of the supplementary material we show the evolution of 
modularity for different choices of /. It is clear that there 
are no significant changes in the results if / belongs to 
the range [5%, 20%]. 

In order to better understand the evolution of the fi¬ 
nancial market network, it is useful to first visualize how 
its structure is organized near critical points. A useful 
and clearly defined event that can be used as a refer¬ 
ence for the effect of financial instabilities in the network 
structure is the famous Black Monday crisis, which oc¬ 
curred in October 19, 1987 mm- After employing the 
methodology described above for network construction, 
the community structure of the network at each business 
day was found using the multilevel community detection 
algorithm [58]. The resulting modularity of the detected 
communities is shown in Fig. [3] We note that distinct 
connected components of the network are always assigned 
to different communities. In Fig. [3] we also show network 
visualizations corresponding to four different instants of 
time, where each node color represents a different com¬ 
munity. Furthermore, in order to correctly observe the 
communities evolution, the community membership of 
each node is calculated only in network A of Fig. [3j so 
that the community assignment is kept fixed for the vi¬ 
sualization of networks B, C and D. 

From the figure, it is clear that before the crisis the 
modular structure is mainly composed of two predomi¬ 
nant communities. As the network approaches the cri¬ 
sis the network changes drastically, and the community 
structure substantially vanishes. Only a highly connected 
cluster at the center of the network remains. Note that 
since we fix the average degree, the number of edges 
in networks A and B is the same, only the connectiv¬ 
ity is changed. At this epoch, most stocks are discon¬ 
nected, meaning that the prices evolve without strong 
correlations. During the crisis, the remaining connected 
components exhibit a more homogeneous structure, as 
suggested by the lower values of modularity. Similar 
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FIG. 2: (Color online) Diagram illustrating the method to construct the financial market networks. The network is constructed 
by calculating the correlations between the stocks returns Yi (i — 1,2,...,N) inside a time window of length At. Next, by 
shifting this time window by amounts St until the end of the database is reached, we obtain the network evolution. 



Before crisis During crisis After crisis Far away from crisis 



FIG. 3: (Color online) Modularity of the NYSE networks in distinct times during and around the Black Monday crisis [2T1I57] . 
We also show a visualization of the network at four specific days. Node colors correspond to the community structure found 
for network A. We note that the average degrees of networks A, B, C and D are the same. 
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repercussions of the Black Monday were reported in the 
temporal analysis of the Minimum Spanning Tree (MST) 
constructed from the NYSE data ehe> 7]. The authors 
observed that the asset tree shrinks, i.e. the normalized 
tree length decreases, reducing the topological distances 
between the stocks. This result also agrees with other 
findings on the structural organization of financial mar¬ 
ket networks [46, 159U63] . It is also interesting to note in 
Fig-1 that, throughout the considered period, the com¬ 
munities preserve most of their membership composition, 
even in very long periods after the crash, when the net¬ 
work becomes reconnected again. 

In order to quantitatively investigate the relationship 
between a financial crisis and network community struc¬ 
ture, we analyze a set of well known crisis periods. These 
periods are marked alongside the curve of the modularity 
in Fig. [4j for all business days of our dataset. We also 
show in the same plot the evolution of the average short¬ 
est path length of the network. This measurement seems 
to be less sensitive to fluctuations outside crisis periods, 
and therefore is able to provide additional confirmation 
that there are significant structural changes during crises. 
For each considered crisis, we observe an increase of mod¬ 
ularity around the time span of the crisis. This indicates 
that indeed the modularity can capture topological net¬ 
work properties related to the financial crisis occurring 
in the system. For instance, the 1990 —)> 1991 recession 
seems to be closely related to the period of high values 
of modularity between mid 1990 and early 1991. 

An important aspect of analyzing the modularity of 
time-evolving networks is that changes in modularity can 
be caused by two distinct factors. It may be due to 
changes in the community membership of the nodes or 
because of changes in the network topology. In order to 
study the relative impact of both of these aspects, we 
define three types of modularity, namely dynamical , fixed 
and lagged. For the dynamical modularity , at each time 
step node memberships are recalculated using the multi¬ 
level community detection algorithm |58|. Therefore the 
modularity reflects the best network partition found by 
the algorithm at each day. In Fig. [5] we show in blue 
the evolution of the dynamical modularity for the en¬ 
tire time series. We see that this measurement contains 
spikes, that is, increased values of dynamical modularity 
during short periods, while there is no long-time variation 
of the modularity. Although the dynamical modularity 
is a useful indicator of the overall association between 
stock prices, it provides no information about the actual 
changes occurring inside the communities. For example, 
the membership of the nodes can change along time steps 
without modifying the dynamical modularity. 

Therefore, we also characterize the community evolu¬ 
tion of the financial market by using a fixed modularity 
defined as follows. The community structure found on 
the first time step defines the membership of the nodes 
for all time steps, and the modularity is calculated based 
on this membership. In Fig. [5] we show in orange the 
evolution of the fixed modularity. It is likely that, over 


a long time, stocks may evolve into different communi¬ 
ties, so fixed modularity tends to decrease with time. It 
is clear that spikes are less pronounced than when using 
the dynamical modularity. Also, there is a clear transi¬ 
tion of the fixed modularity around the year 2002, which 
means that there was a noticeable change of the financial 
market at this epoch. We will discuss this transition on 
the following section. Nevertheless, an obvious problem 
with the fixed modularity is that there is nothing to dis¬ 
tinguish the starting point of the time series, where the 
community composition and node membership was cal¬ 
culated. This problem can be solved by defining a lagged 
modularity , in which the memberships are calculated at 
time t — t a and used to determine the modularity at time 
t. If £a = 0? we simply recover the dynamical modularity. 
For large £a the values tend to the fixed modularity, since 
when t — t& < 0 we consider the memberships calculated 
in the first time step. To some extent, the value £a is 
related to the memory of the market, that is, the extent 
to which its structure changed during the interval t/\. In 
Fig. [5]we show the lagged modularity for = 100 busi¬ 
ness days. In Fig. S2 of the supplementary material we 
present the lagged modularity for different values of £a- 
In a manner similar to the fixed modularity, the spikes 
of the lagged modularity are also less pronounced than 
when using the dynamical modularity. This means that 
although the market contains pronounced communities 
during the spike periods, the composition of the commu¬ 
nities are varying with time. 


III. TOPOLOGY CHARACTERIZATION FROM 
DAILY MIXING MATRICES 

In the stock exchange system, companies are typically 
organized in several sectors (i.e. industry, services, bank¬ 
ing, etc.) which may possess very distinctive relationship 
patterns and time series behavior. The inferred networks 
are also expected to exhibit a modular organization based 
on node sectors. In the previous section, we showed qual¬ 
itatively that the community structure on the inferred 
networks may be related to periods of crisis. 

To further investigate the importance of the commu¬ 
nity structure in this system, we use a stochastic block- 
model to generate networks that preserve only informa¬ 
tion about the community structure of the inferred net¬ 
works. Starting from the set of inferred networks ob¬ 
tained from the NYSE dataset, for each time step t we 
obtain the community structure C(t ) by employing the 
multilevel community detection method proposed in |)8j . 
From the resulting partitioning of the nodes, we obtain a 
set of community mixing matrices II (t). The elements in 
U a p(t) correspond to the number of connections between 
each pair of communities (a, /3) at time £, where a G C(t) 
and (3 G C(t). Additionally, the number of nodes within 
each community cq |a|, is also considered. We define our 
null model in terms of a sequence of stochastic mixing 
matrices, i r(£), that can be expressed by a normalization 


6 







*S I HS I Hg | £*§ | £^3 | £*g | £^g | £^g | £^g | Hg | £^g | £3.g | $5§ | £^g | H§ | Hg | £sg | £4g | £*£ | £^g | £3g | £^g | £sg | £^g | 

1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 


FIG. 4: (Color online) Modularity (blue) and average shortest path length (orange) of the NYSE networks for all businnes days 
in our dataset. Known periods of crisis and bear market trend^] [5H are shown. We considered only crises that had influence 
on the US stock market. Note that the dates specified by the source are approximate and, in general, the duration of a crisis 
is not well-defined in the literature, with the exception of marked crises such as the Black Monday. 

c http://www.wsj.com/articles/SB119239926667758592 
http: //www. telegraph. co. uk/f inance/2773265/Billionaire-who-broke-the-Bank- of -England. html 
http://www.guardian.co.uk/technology/2005/may/16/media.business 

http://news.bbc.co.uk/onthisday/hi/dates/stories/april/26/newsid_2503000/2503271.stm 
http://www.theguardian.com/world/2003/apr/09/russia.artsandhumanities 
Retrieved 09 May, 2015 
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FIG. 5: (Color online) Evolution of the dynamical modularity, fixed modularity and lagged modularity of the stock market 
network with time. Crisis periods are shown in a manner similar to Fig. [4] 
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The parameter i r(t) corresponds to the same probabil¬ 
ity of connection between groups on the simplest block- 
model [52]. However, here we took a different approach 
to the generative model, we apply a wiring process more 
similar to the way that simple Erdos Renyi networks are 
constructed. For each instant, a network is initially gen¬ 
erated as a fully disconnected graph with ^JaeC(t) M 


nodes. Each node is assigned to a community a E C(t) 
in agreement with the given community sizes, \a\. Next, 
pairs of nodes (z, j), with community memberships z £ a 
and i G ft are connected according to the probability 
7 T a p(t) . The resulting networks contain communities 
formed by uniform random graphs (similar to the Erdos 
Renyi model), and the only characteristic they share a 
priori with the inferred networks is the community struc¬ 
ture. Since we recalculate the communities at each time 
step, the topology is based on the daily mixing matrices 
of the inferred networks. 

The remainder of this section is devoted to comparing 
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the topological properties obtained from our community 
null model with those obtained for the inferred networks. 
In addition, we also compare our results with those ob¬ 
tained from networks generated using the configuration 
model [34]. This model is widely used in the literature to 
generate a null model of the network under study. This 
is so because the generated networks are guaranteed to 
have the same degree distribution as the original ones, 
while any other structural property not related to indi¬ 
vidual node degrees is the same as in an uniformly ran¬ 
dom network. Our aim is to uncover in which conditions 
the community null model might be a better descriptor 
of the inferred network topology. 

We measured eight topological characteristics [65] , 
namely dynamical modularity m, average shortest path 
length [66j, degree assortativity m, transitivity [68], av¬ 
erage betweenness [69], clique number EQ!> righ-club co¬ 
efficient EDH2 and average matching index [65|. The re¬ 
sults for the first four mentioned measurements are shown 
in Fig. [6] For brevity, the plots comparing the remain¬ 
ing network characteristics are included in Fig. S3 of the 
supplementary material. 

The dynamical modularity, shown in Fig.J^al), is mea¬ 
sured to provide a confirmation that our null model is in¬ 
deed capturing the community structure of the inferred 
networks. As expected, the modularity of the networks 
generated by only fixing the degree distribution, shown 
in green, is usually low. The only period where such 
networks display noticeably large values of modularity is 
during the Black Monday crisis, which happens because 
the network becomes highly disconnected in this partic¬ 
ular crisis. In order to provide a quantitative compari¬ 
son of the time series, we measure the average squared 
difference between the calculated values for the inferred 
network and the community and degree distribution null 
models. The result is shown as a table in the upper right 
corner of the plot. In Fig. [6j]a2) we show a scatter plot 
between the modularity observed in the inferred network 
and the modularity of the respective networks generated 
from the community and configuration models. The blue 
line indicates a y = x relationship, which would be the 
optimal agreement between the inferred and artificial net¬ 
works. As noted before, there is an apparent transition 
of market behavior around the year 2002. Therefore, we 
also divide the whole series into two intervals, the period 
1987 —>> 2002, represented by darker markers, and the 
period 2002 —>> 2011, represented by lighter marks. In 
the same figure, we also show the Pearson correlation co¬ 
efficient between the measurements for both periods. It 
is useful to measure both the average squared difference 
and the Pearson coefficient because while the former con¬ 
siders only the absolute difference between the measure¬ 
ments, the latter indicates how well the two properties 
vary together. This is useful to account for cases where 
the two properties display a large difference in absolute 
value, but their changes along time are still related. In 
such cases, the null model will still reflect sudden changes 
in the inferred network topology during crises. 


In Fig.^bl) we show the average shortest path length 
of the inferred and generated networks. This measure¬ 
ment does not change significantly with time, except 
at epochs where the modularity increases, since a more 
modular network tends to give on average larger shortest 
distances. It is also clear that there is a larger deviation 
from the null model after the year 2002, although the 
variations between different periods are still represented 
by the generated networks, as indicated by the large 
Pearson correlation coefficients indicated in Fig. (6^b2). 

The degree assortativity of the networks is shown in 
Fig.gc 1), where again there was a significant relation¬ 
ship between the community null model and the inferred 
network. The assortativity of a uniformly random net¬ 
work should be close to zero, since there is no preferen¬ 
tial connectivity between nodes of similar degree. There¬ 
fore, one mighty conclude that our null model should 
produce non-assortative networks, but the community 
description of a network is so effective that it also con¬ 
strains the degree assortativity. For example, if one of 
the network communities is denser than the remaining 
communities, then the nodes belonging to this commu¬ 
nity will have large degrees and have a tendency to be 
more strongly interconnected. This increases the assor¬ 
tativity of the network. The scatter plot comparing the 
degree assortativity between the inferred and modeled 
networks (Fig. [6^c2)) confirm this idea. The community 
null model follows closely the y = x line, specially in the 
1987 —>> 2002 period. 

The transitivity (shown in Fig. [6jdl)) is the charac¬ 
teristic that displays the largest absolute deviation be¬ 
tween the inferred and artificial community networks, 
when compared to the respective fixed degree distribu¬ 
tion networks. This occurs because the transitivity of 
uniformly random networks tends to zero for large net¬ 
work sizes [65]. Since the inferred networks actually have 
a non-zero transitivity, the null model will usually have a 
lower transitivity values than for the inferred case. Nev¬ 
ertheless, the transitivity variations between time steps 
are still carried over the simulated networks, since the 
community structure captured by the null model also 
has a strong influence on the transitivity. This is con¬ 
firmed by the large Pearson correlation coefficients ob¬ 
tained for the scatter plot shown in Fig. [6](d2) . It is in¬ 
teresting that the degree distribution alone can provide a 
good representation of the inferred networks transitivity 
in the 2002 —>• 2011 period, while the community model 
presents a strong deviation in the transitivity values. 

In order to summarize the results obtained for the eight 
measurements calculated over the inferred and artificially 
generated time-evolving networks, we apply the Princi¬ 
pal Component Analysis technique m to the data. The 
original 8-dimensional space spanned by the obtained 
network measurements is projected onto a 2-dimensional 
one, defined by the first two PCA components. The re¬ 
sult is shown in Fig. [7] We observe that the first princi¬ 
pal component (PCA1) is strongly related to changes in 
the network structure along time, since the main axis of 
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FIG. 6: (Color online) Dynamical modularity, average path length, degree assort at ivity and transitivity obtained for the NYSE 
networks (blue), configuration null model (green) and community null model (orange). Plots on the left show the time evolution 
of the measurements. Plots on the right show the scatter plot between each measurement obtained from the inferred networks 
and the artificially generated networks. The blue line indicates a y — x relationship. The average squared differences and 
Pearson correlation coefficients between the inferred and simulated time series are shown as tables included inside each plot. 
Both measurements are calculated separately for the pi 987->-2002 and ^ 2002^-2011 periods. 
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dispersion in the original 8-dimensional space is a conse¬ 
quence of the network evolution. Therefore, the models 
should be compared according to the second principal 
component (PCA2). Thus, it is clear that the commu¬ 
nity null model carries over many characteristics of the 
inferred networks. Keeping only the daily degree distri¬ 
bution of the inferred networks seems to be an inefficient 
method for reproducing the original network topology. 
By inefficient we mean that storing the entire daily degree 
distribution of the network requires much more informa¬ 
tion than storing the daily mixing matrix describing the 
community structure. 



First principal component (PCA1) 


FIG. 7: (Color online) PC A obtained from all considered mea¬ 
surements for the inferred networks (blue), the community 
preserving null model (orange) and by fixing the degree dis¬ 
tribution (green). 


As a final example, in Fig. [8] we visually compare the 
inferred networks with those generated by our null model. 
The inferred networks were sampled every 1000 time 
steps between 10 May 1991 and 26 March 2007. To con¬ 
struct the visualization we use a force-directed method 
based on the Fruchterman-Reingold algorithm [74] for 
which only the network topology is taken into account, 
hence no information about the communities is consid¬ 
ered aside from nodes colors. Again, we observe that 
the networks constructed from the mixing matrices are 
able to qualitatively describe the topology of the inferred 
financial networks. 


IV. CONCLUSIONS 

The study of the evolving financial market network is 
of great interest for many reasons. Besides improving 


decisions related to industrial development and overall 
national growth, a reliable description of the evolution 
of a financial market can provide indicators of an immi¬ 
nent widespread stock value decline, which we refer to 
as a stock market crisis. In order to infer the underly¬ 
ing financial market network for a group of companies, 
one widespread technique is to obtain the correlation net¬ 
works of the stocks exchanged between them |17j. Such 
inferred networks tend to convey many relevant proper¬ 
ties of the actual financial network established by the 
financial market. In addition, since companies can al¬ 
ways be divided into distinct subgroups, related to their 
respective sector of activity or partnerships, the analysis 
of the community structure of the financial market is of 
considerable importance to understanding its dynamics. 

The goal of this paper was to show that the community 
structure of the financial market contains an almost com¬ 
plete description of the dynamics of the system, in the 
sense that many of its characteristics can be recovered by 
knowing only the communities over a given time interval. 
We here demonstrated that by using the mixing matri¬ 
ces obtained from the inferred networks, many network 
measurements can indeed be recovered by generating a 
random network following the connectivity pattern indi¬ 
cated by these matrices. Interestingly, after the year 2002 
the inferred networks become less related to the commu¬ 
nity preserving artificial networks generated by the null 
model. This effect remains the topic of future work. We 
found no clear explanation for the mechanisms involved 
in the formation and evolution of the communities. 

The results indicate that, on many occasions, it is more 
suitable to use the community preserving null model to 
attest the statistical significance of experimental obser¬ 
vations on financial market networks. Furthermore, the 
methodology can be applied to any system where the 
underlying network can be inferred through similarity 
measurements. If the system displays a natural represen¬ 
tation based on its constituent communities, we expect 
the framework proposed here to provide an accurate de¬ 
scription of topological changes occurring in the system. 
It also remains to be studied if different pruning tech¬ 
niques EH [76], used to transform a correlation matrix 
into the adjacency matrix, can improve the results or 
provide new insights about the data. In addition, other 
datasets related to the financial market, such as inter¬ 
bank ownership, could provide additional developments 
about the relevance of community formation during pro¬ 
nounced market crises. 
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Appendix A: Supplementary Material of "Modular Dynamics of Financial Market Networks" 
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FIG. SI: Effect of the desired percentage of edges / on the dynamical modularity of the inferred network. 
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FIG. S2: Evolution of the dynamical, fixed and lagged modularity (considering 3 lag values) of the stock market network with 
time. 


































13 


<£> | 

<2002 

>2002 — inferred network 

300 

Community 

419 

2522 —Community model 


Degree dist. 

691 

1 — Degree dist. model 

250 



al 


P <2002 >2002 
Community 0.93 0.95 

Degree dist. 0.77 0.94 


>o|<^o|<^oI<^o |<^oI<^o I<^o I<^oI<^oI<^oI<^oI<^o I <^o I <^o I <^o | <^oI<^oI<^oI<^o I <^o I <^o |<^o I <^o I <^o I 
1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 


Time (Year) 



Inferred Network 





PO 2002 

Community 0.005 
Degree dist. 0.007 


— Inferred network 

— Community model 

— Degree dist. model 



1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 


£ = S |£ J 

2003 2004 


£ = S |£ J 
2005 2006 


S-8 | & 
2007 2008 


2*8 |t = § | 

2009 2010 2011 


Time (Year) 



FIG. S3: Average betweenness, clique number, rich-club coefficient and average matching index obtained for the NYSE networks 
(blue), configuration null model (green) and community null model (orange). Plots on the left show the time evolution of the 
measurements. Plots on the right show the scatter plot between each measurement obtained from the inferred networks and 
the artificially generated networks. The blue line indicates a y = x relationship. The average squared differences and Pearson 
correlation coefficients between the inferred and simulated time series are shown as tables included inside each plot. Both 
measurements are calculated separately for the ^ 1987^2002 and £> 2002^2011 periods. 































































